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Abstract 

The linear discriminant analysis (LDA) method is a classical and commonly utilized technique for dimensionality reduction 
and classification in brain-computer interface (BCI) systems. Being a first-order discriminator, LDA is usually preceded by the 
feature extraction of electroencephalogram (EEG) signals, as multi-density EEG data are of second order. In this study, an 
analytic bilinear classification method which inherits and extends LDA is proposed. This method considers 2-dimentional 
EEG signals as the feature input and performs classification using the optimized complex-valued bilinear projections. 
Without being transformed into frequency domain, the complex-valued bilinear projections essentially spatially and 
temporally modulate the phases and magnitudes of slow event-related potentials (ERPs) elicited by distinct brain states in 
the sense that they become more separable. The results show that the proposed method has demonstrated its 
discriminating capability in the development of a rapid image triage (RIT) system, which is a challenging variant of BCIs due 
to the fast presentation speed and consequently overlapping of ERPs. 
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introduction 

A rapid development of brain-computer interface (BCI) related 
techniques has been seen in the past years. BCI system utilizing 
electroencephalogram (EEG) provides a shortcut of communica- 
tion channel between the human brain and an external device, 
without conventional human's physical response. Therefore, BCIs 
could be the goodwill for physically disabled patients as the 
promising neuroprosthetics solutions [1-3]. In addition, due to the 
advances in computation and communication, BCIs enable the 
new concept of gaming [4] and augment people's performance in 
some applications, one of which is prioritizing images from an 
image pool. Fast search of target images (objects) in large-volume 
imagery, e.g. aerial imagery, has come to a bottleneck. That is, 
limit number of skillful image analysts cannot handle the 
increasing volume of imagery in the conventional way. Recently, 
the rapid image triage (RIT) technique which leverages human 
vision, split-second judgement capability and machine learning for 
EEG signal processing, has proven to be a promising solution by 
researchers [5-9] . It can be applied in various applications such as 
satellite image analysis and image retrieval task. 

In one type of RIT, a large-scale imagery is chopped into a 
number of images of smaller sizes. These images are then 
presented to an image analyst in a sequential order at a fast 
speed, which is called rapid serial visual presentation (RSVP) 
paradigm [5,9,10]. During the RIT, amongst the images, some 
contain objects that are perceived as target objects, and hence are 
required to be identified for further detailed analysis [11]. In 
contrast, other images being irrelevant to the searching task are 



considered as nontargets to be disregarded. The occurrence of 
targets is so rare and infrequent that the searching process wiU 
induce the oddball effect [12,13]. That is, the unique event-related 
potentials (ERPs) measurable on the scalp wiU be elicited by a 
recognized target image. Among these unique target ERPs, the 
P300 which is a prominent positive voltage deflection peaking 
around 300 ms after the onset of the target is the major 
component differing from nontarget ERPs. Hence, the backbone 
of RIT system is to detect, identify and dissociate these two types 
of ERPs. 

Since ERPs are usually overwhelmed by noise such as 
background EEG, it is necessary to resort to various signal 
processing methods to improve the signal-to-noise ratio (SNR). 
Some of the methods are spatial decomposition based, such as 
principal component analysis (PC A) [14], independent component 
analysis (ICA) [15] and common spatial pattern analysis (CSP) 
[16]. Compared to PCA and ICA which extract uncorrelated/ 
independent components, CSP is naturally more suitable for the 
binary classification task, as it extremizes the ratio of temporal 
variance of one condition over the other condition. It has been 
successfully applied in abundant BCI appKcations, including the 
motor imagery [17,18], vowel speech imagery [19] and RIT [20]. 
In order to compensate the deficiency due to the temporal 
invariance, a number of CSP variants have been introduced. 
There are CSP variants that make use of spectral filters in 
conjunction with spatial filters, such as common spatio-spectral 
pattern (CSSP) [21], the common sparse spectral spatial pattern 
(CSSSP) [22] and spectrally weighted common spatial patterns 
(SPEC-CSP) [23]. There are also methods looking for spatio- 



PLOS ONE I www.plosone.org 



1 



June 2014 | Volume 9 | Issue 6 | e100097 



Analytic Bilinear Discrimination of EEG Signals 



temporal projections instead of pure spatial filters [8,9,24]. Being 
different from spectral filtering and temporal filtering, the recendy 
proposed CSP variant, namely analytic common spatial patterns 
(ACSP), emphasizes on the modulation of the phases of EEG 
signals [25,26]. It is accomplished by performing the spatial 
filtering in the complex-valued space, where the phase information 
is stiU preserved. 

The phase of EEG signals is of rich unexplored implications. 
For instance, it has been suggested that the trial-by-trial variability 
of performance could be partially attributed to the fluctuation of 
the phase of ongoing oscillations, and the measurement of EEG 
phase might be useful for the prediction of perceptual and 
attentional variability [27]. The pre-stimulus EEG phase was 
claimed to affect the magnitude of the following auditory ERPs 
[28]. Moreover, phase is also an important property in steady-state 
visual evoked potentials (SSVEPs) [29]. In the context of RIT, the 
Unkage between ERPs and phase is not apparent. However, the 
modulation of phase can change the morphology of ERPs in terms 
of magnitude and latency. And the RS VP paradigm resembles the 
setting of SSVEP as images are shown at specific frequency. 
However, it is noteworthy that spatial phase modulation proposed 
by ACSP, which has shown superior in classification problems 
such as oscillatory EEG [25] and SSVEP [26], may not be optimal 
in the scenario of RIT. In RIT, the target ERPs are slow potentials 
and can be overlapping with a number of nontarget ERPs due to 
the presentation speed. Therefore, a spatio-temporal phase 
modulation can be more useful in RIT. 

In this paper, a method namely analytic bilinear discriminant 
analysis (ABDA) is proposed to address the spatio-temporal 
modulation and classification in RIT. The proposed ABDA 
method belongs to the linear discriminant analysis (LDA) variants, 
which is a traditional dimension reduction and classification 
method for low-dimensional feature vector. It finds the projection 
that maximizes the ratio of between-scatter matrix over within- 
scatter matrix. However, LDA becomes insufficient for handling 
2D images as well as high-density EEG signals [30]. Therefore, 
2D-LDA was proposed to be adapted for 2D matrbc, which derives 
a set of orthonormal projections [31]. On the other hand, bilinear 
discriminant analysis (BDA) [32] and 2-dimensional linear 
discriminant analysis (2DLDA) [33] extend LDA by iteratively 
optimizing bilinear projections instead of one LDA projection. 
There are also trilinear (or even higher dimensional) methods, 
such as parallel factor analysis (PARAFAC) [34] and general 
tensor discriminant analysis (GTDA) [35]. These methods usually 
require the transformation of EEG signals into the frequency 
domain. Moreover, recent advances attempt to address problems 
like the limited sample size and doubtful distribution assumption. 
For instance, a variant of LDA namely enhanced Bayesian LDA 
(EBLDA), enlarges the sample size by incorporating unlabeled 
data with high probability into labeled data to refine the 
classification [36] . In addition, Z-LDA adaptively adjusts decision 
boundary to accommodate the heteroscedastic signal distribution 
through z-score [37]. Compared to other methods, the uniqueness 
of the proposed ABDA method mainly resides in: 1) exploiting the 
complex-valued bilinear projections, i.e. spatial and temporal 
projections in a complex-valued space-time domain; 2) The phases 
of slow ERPs are both spatially and temporally modulated, which 
is useful in the context of discriminating ERPs that are 
overlapping; 3) the spatio-temporal phase modulation demon- 
strated by ABDA accounts for the spatio-temporal propagation of 
slow ERPs. The method evaluation is conducted in the context of 
RIT experiments by comparing it with several competitive 
methods. 



Methods 

2.1 Analytic Presentation 

For a real-valued signabc(?), the corresponding analytic signal 
a{t) is presented as 



ait) = x(t)+jH(x)(t) 



(1) 



where the imaginary part, H(x)(/), is the Hilbert transform of 
The Hilbert transform of x{t) can be given by 



Hix)(t)=-p.v. 



•x(t) 



dx 



(2) 



where p.v. stands for the Cauchy principal value. The effect of 
Hilbert transform is to shift the phases of both negative frequency 
components and positive frequency components of a signal, but in 
different directions, i.e. +90° and — 90% respectively. In addition, 
j introduces another phase shift of-|-90° toH(x)(?). The ultimate 
effect is that, the negative frequency components of the analytic 
signal x{t) are shifted above 0 Hz. In other words, «(?) contains 
only positive frequency components. It is worth noting that the 
phases of positive frequency components of a(?) are the equivalent 
to the counterparts of 

2.2 Objective Function 

Given the band-pass filtered EEG epochs Xi and X2 
(channel X time) under two conditions ("1" for target condition 
and "2" for nontarget condition), the corresponding analytic 
presentations are denoted as Ai and A2, respectively. There are 
two bilinear projections, i.e. complex-valued spatial projection w 
and temporal projection v. The within-class scatter S„, and 
between-class scatter S^, of analytic EEG epochs after temporal 
projecting using v can be given as 



2 lie 



S.= ^^(A>-A,v)(A^v-A,v)* 

C=l !=1 

Si = (Ai V — A2 v)(Ai V - A2 v)* 



(3) 



where A^ and A^ stands for the epoch and the mean matrix 
under condition c, respectively. And ■* represents conjugate 
transpose operator. 

Similarly to LDA, the objective of ABDA can be written as 



7(w,w * ,v,v * ) = Argmax 



w * S„w 



(4) 



It has been shown that there is no analytical solution to a real- 
valued biquadratic equation [8,32], which is a similar case for a 
complex-valued biquadratic equation like (4). However, there 
exists a sub-optimal solution, using the iterative learning. 

Suppose that v is already given, e.g. v is initialized to be an 
identity matrix in this work. Then S„, and Si become also known 
and (4) can be solved by calculating the derivation by the complex- 
valued w*. Specifically, (4) can be rewritten as 



/(w,w * )w * S,„w = w * SftW 



(5) 



In order to maximize /(w,w * ), dJ/dw and dJ/ 5w* shall be set to 
zero. Since dJ/dw and dJ/dw* are complex conjugate transpose 
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Figure 1 . The experimental paradigm. Consecutive bursts each of which contained 50 images were serially presented and separated by a resting 
period. Some of the bursts contained target images. 
doi:1 0.1 371 /journal.pone.01 00097.g001 



of each other, only one of them, e.g. dj/dw*, needs to be 
calculated: 



8J_ 
8w* 



w * S„.w + 2J(w,w * )S,i.w = 2SftW 



Let dJ/dw* be zero, (6) can be further simplified to 



S„, S/,w = /w 



(6) 



(7) 



/(w,w * ,v,v * ) = Argmax 



V * SiV 

V * S„,v 



(10) 



By inserting (9) into (10) and inserting (3) into (4), it can be 
shown that (10) and (4) are actually equivalent. Therefore, after w 
is obtained according to (8), (10) can be used to derive V similarly 
by letting 8J/d\* be zero, which is 



(7) is exactly the same to the eigenvalue problem in LDA, the 
solution to which wiU be 



w = S„, (Aiv— A2V) 



(8) 



On the other hand, the within-class scatter S„, and between- 
class scatter S/, of analytic EEG epochs after spatial projectmg 
using w can be expressed as 



2 lie 

S„, = (A^ * w — Ac * w)(A^, * w — Ac * w)* 

f=i /=i 

S/, = (Ai * w — A2 * w)(Ai * w — A2 * w)* 
And the corresponding objective function is 



(9) 



v = S„, (Ai*w — A2*w) 



(11) 



By iteratively using (8) and (11), the optimal complex-valued 
bilinear projections w and \ can be obtained when the ratio of 
between-class scatter to within-class scatter, i.e. /(w,w * ,V,v * ), 
converges. 

It is noteworthy that, as w, v and A are all complex-valued, the 
projected data, i.e. /) = w*Av, is also a complex value, which 
however cannot be directiy used to get decision boundary. Here, p 
is decomposed to a 2-element vector p, the elements of which are 
the real part and imaginary part of^. The classical LDA method is 
used to find a projection z that separates Pj from P2. 

2.3 Classifier 

Unlike the LDA which handles the m-dimension to one- 
dimension discrimination issue [38], the proposed ABDA tackles 
the m X n-dimensional classification problem. The classifier makes 
use of bUinear projections, LDA projection and the biasZ), as 
indicated in 
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p = w* [X + H(X)]v 

>' = z' [rea/ (/))imag(/7)] ' + h 



(12) 



where •' denotes the transpose operator. Moreover, real(p) and 
imag(/7) represent the real part and imaginary part of /?, 
respectively. 

Generally, the bias b shall be chosen such that the posteriors in 
the projected dimension of two conditions will be equal [38]. 
Appropriate selection of b could be important if the sample size of 
one condition is significantly different from that of the other, 
which is the unbalanced classification problem. 

Experimental Setup 

RIT experiments were conducted under the approval by die 
National University of Singapore Institutional Review Board 
(NUS-IRB). After providing written consent forms for participat- 
ing in the experiment, 22 healthy participants, all right-handed, 
with normal or corrected-to-normal sight, completed the RIT 
experiments. 

3.1 Experiment Design 

The experiment included one training session and one testing 
session. In each session, aerial images were sequentially presented 
to a participant, following the standard RSVP paradigm [5,9,10] 
(see Figure 1). Each image lasted for 150 ms on the centre of the 
screen and then was replaced by the next one. There was a 
temporary break between every 50 images. The duration of the 
break was self-controlled by the participant but was caped to 
10 seconds. These aerial images were of 400x400 pixels. A small 
amount of them (approximately 72) containing objects of interest 
were defined as targets, while others (over 4400) were considered 
as nontargets. The participant was informed that he/she should 
neglect nontargets but was obliged to immediately respond to the 
appearing targets by pressing a button. 

3.2 Acquisition and Preprocessing 

For every participant, 62-channel EEG signals were collected at 
250 Hz, using an ANT amplifier (ANT B.B., Enschede, Nether- 
lands). EEG signals were referenced to linked ears and grounded 
to the forehead. The 4* order Butterworth fdter was adopted, with 
the pass-band from 1 Hz to 25 Hz. The filtered signals were 
segmented into epochs, the time window of which starts from the 
onset of each image to 500 ms after the onset. 

In RIT experiments, sometimes there could be a few bad 
channels (malfunctioning channels) and these bad channels might 
deteriorate the performance. Hence, ahead of training the 
classifier using data collected in training session, bad channels 
were automatically identified, which would be excluded from both 
training data and testing data. This was accomplished by 
monitoring each channel across all epochs in training session. 
For instance, if the absolute difference between the maximum 
value and the minimum value (or the mean value) is significantiy 
large for a particular channel over 30% of total epochs, this 
channel would be labeled as the bad channel. 

It is worth noting that the removal of identified bad channels 
was based on training data. However, there could be more bad 
channels in testing session. Thus, an additional measure was 
introduced. That is, every epoch was examined whether there 
were any suspected abnormal channels. These suspected channels 
would be replaced by spherical spline interpolation using 
neighboring functioning channels [39] . 
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Iteration Iteration 

Figure 2. The ratios of between-class scatter to within-class scatter with respect to the iteration steps for 22 participants. 

doi:1 0.1 371 /journal.pone.01 00097.g002 



Evaluation 

With complex-valued bilinear projections, ABDA is assumed to 
be able to modulate the phases and the magnitudes of signals in 
the manner that target ERPs and nontarget ERPs become more 
differentiable in the context of RIT. This assumption was verified 
by comparing the proposed ABDA with CSP, ACSP and BDA 
which omits the phase modulation. All methods were applied to 
the EEG data collected from 22 participants in RIT experiments 
on a single-trial basis. The classifiers were derived using target 



epochs and nontarget epochs of training data, and the single-trial 
classification results for comparison were obtained from testing 
data. It is noteworthy that 4 features were extracted by CSP/ 
ACSP using the most discriminative filters that had been derived 
and these features were fed to the conventional FED classifier. The 
performance measure adopted was the balanced accuracy (BA) 
[9], which accommodates the unbalanced sample sizes between 
targets and nontargets. BA is defined as. 
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Figure 4. The average ERPs of Participant P22 after spatial projection using BDA and ABDA method, (a) shows the projected signals of 
target condition, (b) shows the projected signals of nontarget condition. ABDA-real is the real part of the projected signals, while ABDA-imag is the 
imaginary part of the projected signals. 
doi:10.1371/journal.pone.0100097.g004 



balanced accuracy 

1 ^ correctly classified targets ^ correctly classified nontargets (13) 

2 targets nontargets 



Results and Discussion 

The classification results are shown in Table 1. It can be seen 
that ABDA outperformed other methods for 18 out of 22 
participants. The average BA achieved by ABDA was close to 
90% and was 5.9% higher than CSP, 3.9% higher than ACSP and 
2.5% higher than BDA, respectively. The better performance of 
ABDA over BDA was statistically significant in paired i-test, with 
j6-value<0.05, which however is not significant (/)-value = 0.017) in 
the Z-test with Bonferroni correction which is conservative. 
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Figure 5. The temporal courses of Participant P22's normalized 
temporal projections using BDA and ABDA methods. ABDA-real 
is the real part of the ABDA temporal projection, while ABDA-imag is 
the imaginary part of the ABDA temporal projection. 
doi:1 0.1 371/journal.pone.01 00097.g005 



Moreover, ABDA significantly surpassed CSP and ACSP, 
respectively, with /(-value < 0.001 in both paired i-test and Z-test 
with Bonferroni correction. On the other hand, though BDA 
significantly outperformed CSP (/)-value<0.01), its advantage over 
ACSP was insignificant (/)-value>0.15) in i-test with Bonferroni 
correction. 

According to (4) and (10), mi and v are the desirable projections 
that maximize the objective function. Although the derivations of 
w and V were complex-valued calculation, the ratios 
/(w,w * ,v,v * ) obtained during every iteration were real values 
(see Figure 2), as S^. and S/, (S„- and S/,) were semi-definite 
matrices. In Figure 2, it can be seen that for all the 22 participants, 
J(w,w * ,v,v * ) initially increased and would quickly converge to a 
constant value after several iteration steps. This indicates that the 
iterative learning was useful, and there always existed a pair of 
complex-valued bilinear projections which fulfilled the objective 
function. And most importantiy, these projections could be 
consistently and reliably obtained for all participants. 

The obtained ABDA spatial and temporal projections contained 
real parts and imaginary parts. According to Euler's formula, a 
complex value has a corresponding complex exponential function 
consisting of two variables, i.e. magnitude and phase. In particular, 
the normalized spatial projection for P22 was plotted and 
compared to the counterpart, i.e. the BDA spatial projection, in 
Figure 3. It can be seen that the phases of BDA spatial projection 
were binary. That is, they could only be either 0° or 180°, 
indicating positive or negative sign, respectively. In contrast, there 
was more flexibility in the ABDA spatial projection. As manifested 
in Figure 3, the ABDA phases ranged from —180° to 180°. This 
freedom allowed a delicate modulation of the phases of temporal 
signals in each channel. The phase modulation is very useful, as it 
can change the morphology of signals, including amplitude and 
latency, to improve classification. For instance, the P300, a major 
signature in this RIT task, propagates from frontal to parietal areas 
on the scalp [40]. Such latency differences among these spatially 
distributed EEG channels could be estimated and utilized for 
denosing [20] . It can also be applied to other EEG signals such as 
steady-state visual evoked potentials (SSVEPs) [41]. Modulating 
phases of every individual channel led to the increase in the 
number of channels that were important for discrimination. As can 
be seen in Figure 3, there were much more channels of high 
magnitude (high weight) for ABDA in comparison to BDA. In 
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addition, these critical channels were broadly spread such as 
midline, which is in accordance with the fact that P300 is 
measurable widely on the scalp. 

The modulation offered by ABDA has a noticeable conse- 
quence which can be observed in Figure 4. After being spatial 
projected, both real part and imaginary part of target ERPs 
showed a more prominent P300 component, as compared to that 
in the scenario of BDA. For nontarget ERPs, the real part and 
imaginary part were weaker than BDA, although the difference 
seemed to be less significant as those in Figure 4A. Therefore, the 
enlarged difference between target ERPs and nontarget ERPs in 
Figure 4 might imply that target condition and nontarget 
condition became more separable by the ABDA method. 
However, the overall performance is determined not only by the 
spatial projection but also the temporal projection. 

The normalized temporal projections of BDA and ABDA were 
plotted in Figure 5. Both BDA and ABDA temporal projections 
appeared to contain high frequency components. It may be due to 
the fact that, the temporal resolution (time point) was higher than 
the spatial resolution (the number of electrodes) and a regulari- 
zation term was not adopted during the iteration learning. From 
the perspective of the waveform, BDA imposed heavier weights on 
the first half of the time window, i.e. between 0 ms and 350 ms. 
These weights could be meaningful. As can be seen in Figure 4A, 
the projected target ERPs (blue line) peaked at 350 ms, which 
matched the corresponding weights in Figure 5, indicating a 
linkage between BDA temporal projection and BDA spatial 
projection. On the other hand, the ABDA method focused mainly 
on the late stage of the target ERPs, i.e. 400 ms, and the waveform 
looked very clean before 200 ms. This is in line with the work of 
Gerson et al. [5], where the prominent discriminating activities 



were observed after 350 ms. It also followed the peak of spatially 
projected signals in Figure 4A, in particular the ABDA-imag 
(green line). 

With the temporal projections as shown in Figure 5, the 
projected spatial topographies of two conditions were obtained in 
Figure 6. For both methods, the magnitude difference between 
target condition and nontarget condition were apparent, suggest- 
ing a strong discriminating capability of the temporal projection. 
With respect to the comparison between BDA and ABDA, there 
was a similar observation in Figure 3. That is, ABDA seemed to 
exploit a larger region for classification, from frontal, central to 
parietal and occipital, which was demonstrated by the magnitude 
mappings of BDA and ABDA under target condition. Further- 
more, it is interesting to note that the phase mapping of ABDA 
under target condition partially showed the gradual propagation 
pattern of target ERPs, e.g. P300. It is known that the latency of 
P300 is shorter over frontal areas and longer over parietal areas 
[42]. At the first glance, it seemed that there was a noticeable 
phase gap between frontal areas (dark red) and central areas (dark 
red). However, it is noteworthy that phase is periodic and 180° is 
equivalent to — 180°. Therefore, the phase difference between the 
frontal and the central areas was actually small. In general, it could 
be said that the color-coded phases of ABDA in Figure 6 
progressively changed from dark red, dark blue to yellow color 
along the scalp, resembling the P300 latency changes. The phase 
mapping in Figure 6 indicated the phase difference between the 
signals in EEG channels in a quantitative manner. Unlike ABDA, 
BDA did not account for the ERP propagation. For instance, in 
Figure 6 under target condition, all phases in the frontal and 
central areas were 180° (negative), and the rest were 0° (positive). 
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Figure 7. The plotting of average ERPs of Participant P22 and tlie outcomes after tKie spatio-temporal projection by the BDA and 
ABDA methods. The first row shows the target ERPs and nontarget ERPs along all the 62 channels, and are under the scale of [-5 5]. The other 
three rows are the element-wise products of ERPs (or the corresponding analytic representation) and the spatio-temporal projection. The scale is [— 
0.1 0.1]. ABDA-real is the real part of the ABDA element-wise products, while ABDA-imag is the imaginary part of the ABDA element-wise products. 
doi:1 0.1 371/journal.pone.01 00097.g007 
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Since there is an inherent relation between the iteratively 
optimized spatial projection and temporal projection, evaluating 
the spatial projection and temporal projection in a separate way 
may be insulTicient for viewing the big picture. Figure 7 illustrates 
the combined effect of bilinear projections. Given the ERPs (see 
the first row in Figure 7), the element-wise product for ABDA was 
calculated using. 

EP = conj(yi)conj(y * )oA (14) 

where conj(») stands for the conjugate function and » represents 
the element-wise product multiplication operator. The formula for 
BDA was simpler: 



EP = wv'oX (15) 

It is worth noting that the summation of all the elements of EP is 
equivalent to /) in (12). According to the EP of ABDA and EP of 
BDA in Figure 7, it can be seen that ABDA mainly relied on the 
late stage of target ERPs, whilst the early ERP components were 
favored by BDA. Moreover, a larger number of channels and time 
points were 'highlighted' by ABDA to distinguish target condition 
from nontarget condition. On the other hand, BDA depended on 
relatively limited spatio-temporal signal segments. Additionally, 
there is a kind of 'texture' at the first row of Figure 7, which can be 
attributed to the propagating process of ERPs on the scalp. Such a 



texture is also observable in the EP of BDA (the second row), 
which however, became absent in the EP of ABDA. The absence 
of this texture should be the result of the phase and magnitude 
modulation introduced by the complex-valued bilinear projec- 
tions, which counteracted the latency differences among channels. 

Conclusions 

In this study, ABDA, the analytic biUnear discriminant analysis, 
a linear discriminant analysis originated method, is proposed and 
has been applied to the development of the RIT system. The 
results showed that, without transforming into frequency domain, 
the ABDA method is capable of modulating the phases and 
magnitudes of slow ERP signals that are overlapping with other 
ERPs, using the coupling of complex-valued spatial projection and 
complex-valued temporal projection. The complex-valued bilinear 
projections accommodated the spatio-temporal phase variations of 
ERPs, and consequently enabled a better usage of high-density 
EEG measurement to perform the classification task. With the 
ABDA, the RIT tests have showed an average accuracy increase of 
2.5% over that with the BDA method and also outperformed CSP 
and ACSP. 

Author Contributions 

Conceived and designed the experiments: KY XL. Performed the 
experiments: KY. Analyzed the data: KY. Contributed reagents/ 
materials/analysis tools: KY XL. Wrote the paper: KY HAI NT. 



References 

1. Lcuthardt EC, Schalk (i, Moran D, OjcmannJG (2006) t he emerging world of 
motor ncuroprosthctics: a neurosurgical perspeetive. Neurosurgery .^9: 1—14. 

2. Hochbcrg LR, Serruya MD, Friehs GM, MulsandJA, Saleh M, et al. (2006) 
Neuronal ensemble control of prosthetic devices by a human with tetraplegia. 
Nature 442: 164-171. 

3. Lcudrardt EC, Schalk G, Wolpaw JR, Ojemann JG, Moran DW (2004) A 
brain— computer interface using electrocorticographic signals in humans. Jolunal 
of neural engineering 1: 63. 

4. Finke A, Lenhardt A, Ritter H (2009) The MindGame: a P300-based brain- 
computer interface game. Neural Networks 22: 1329-1333. 

5. Gerson AD, Parra LC, Sajda P (2006) Cortieally coupled computer vision for 
rapid image search. IEEE Trans Neural Syst Rehabil Eng 14: 174-179. 

6. tiughes G. Mathan S, Yeung N (2012) EE(j indices of reward motivation and 
target deteetability in a rapid visual detection task. Neurolmage. 

7. Huang Y, Erdogmus D, Pavel Mathan S, Hild KE (201 1) A framework for 
rapid visual image search using single-trial brain evoked responses. Ncurocom- 
puting 74: 2041-2051. 

8. Yu K, Shen K, Shao S, Ng WC, li X (201 2) Bilinear common spatial pattern for 
single-trial ERP-based rapid serial visual presentation triage. J Neural Eng 9; 
046013. 

9. Yu K, Shen K, Shao S, Ng WC, Kwok K, et al. (2011) Common spatio- 
temporal pattern for single-trial detection of event-related potential in rapid 
serial visual presentation triage. Biomedical Engineering, IEEE Transaction on 
5R: 2,')l;i-2,")20. 

10. Sajda P, Pohlmeyer E, Jun W, Parra LC, Christoforou C, et al. (2010) In a Blink 
of an Eye and a Switch of a Transistor: Cortieally Coupled Computer Vision. 
Proceedings of tire IEEE 98: 462-478. 

11. Pohlmeyer EA, WangJ, Jangraw DC, Lou B, Chang S-F, et al. (2011) Closing 
the loop in cortically-coupled computer vision: a brain-computer interface for 
searching image databases. Journal of neural engineering 8: 03602.5. 

12. Bernat E, Shevrin H, Snodgrass M (2001) Subliminal visual oddball stimuli 
evoke a P300 component. Clinical neurophysiology 112: 159—171. 

13. Polieh J, Criado JR (2006) Neuropsychology and neuropharmacology of P3a 
and P3b. International Journal of Psychophvsiologv 60: 172—185. 

14. Lagcrlund TD, Sharbrough EVV, Busacker NE (1997) Spatial filtering of 
multichannel elcctrocncephalographic recordings through principal component 
analysis by singular value decomposition. Journal of Clinical Neurophysiology 
14; 73-82. 

15. Makeig S, Bell AJ, Jung T-P, Sejnowski TJ (1996) Independent component 
analysis of electioencephalographic data. Advances in neural information 

processing systems: 14,5—151. 

16. Ramoser H, Mtjller-CierkingJ, Pfurtscheller G (2000) Optimal spatial filtering of 
single trial EEG during imagined hand movement. Rehabilitation Engineering, 
IEEE Transaction on 8: 441^6. 



17. Samek W, Vidaurre C, MijUer KR, Kawanabe M (2012) Stationary corrmion 
spatial patterns for brain-computer interfacing. J Neural Eng 9: 026013. 

18. Blankertz B, Tomioka R, Lemm S, Kawanabe M, Mtiller KR (2008) Optimizing 
Spatial filters for Robust EEG Single-Trial Analysis. IEEE Signal Processing 
Magazine 25: 41-56. 

19. DaSaUa CS, Kambara H, Sato M, Koike Y (2009) Single-tiial classification of 
vowel speech imagery using coirmion spatial patterns. Neural Networks 22: 
1334-1339. 

20. Yu K, Shen K, Shao S, Ng WC, Kwok K, et al. (2012) A spatio-temporal 
filtering approach to denoising of single-trial ERP in rapid image tiiage. 
J Neurosci Metiiods 204: 288-295. 

21. Lemm S, Blankertz B, Curio G, Mtiller KR (2005) Spatio-spectial filters for 
improving the classification of single trial EEG. IEEE Trans Biomed Eng 52: 
1541-1548. 

22. Dornhege (i, Blankertz B, Krauledat M, Eoseh F, Curio G, et al. (2006) 
Combined optimization of spatial and temporal filters for improving brain- 
computer interfacing. Biomedical Engineering, IEEE Transaction on 53: 2274— 
2281. 

23. Tomioka R, Dornhege G, Nolle G, Blankertz B, Aihara K, et al. (2006) 
Spectrally Weighted Common Spatial Pattern Algorithm for Single Trial EEG 
Classification. Dept. Math. Eng., Univ. Tokyo. 

24. Yu K, Wang Y, Shen K, li X (2013) The Synergy between Complex Channel- 
Specific FIR Filter and Spatial Filter for Single-Trial EEG Classification. PLoS 
ONE 8: e76923. 

25. Falzon O, Camilleri KP, Muscat J (2010) Complex-valued spatial filters for task 
discrimination. Conf Proc IEEE Eng Med Biol Soc 2010: 4707-4710. 

26. Falzon O, Camilleri K, Muscat J (2012) Complex-valued spatial filters for 
SSVEP-based BCIs with phase coding. IEEE Trans Biomed Eng 59: 2486- 
2495. 

27. VanRuUcn R, Busch N, Drewes J, Dubois J (2011) Ongoing EEG phase as a 
trial-by-trial predictor of perceptual and attentional variability. Frontiers in 
psychology 2. 

28. Kruglikov SY, Schiff SJ (2003) Interplay of electroencephalogram phase and 
auditory-evoked neural activity. Thejoumal of neuroscience 23: 10122-10127. 

29. Jia C, Gao X, Hong B, Gao S (2011) Frequency and Phase Mixed Coding in 
SSVEP-Based Brain— Computer Interface. Biomedical Engineering, IEEE 
Transactions on 58: 200-206. 

30. Lu H, Plataniotis KN, Vcnetsanopoulos AN (2011) A survey of multilinear 
subspace learning for tensor data. Pattern Recognition 44: 1540-1551. 

31. Li M, Yuan B (2005) 2D-LDA: A statistical linear discriminant aiialvsis f or image 
matrix. Pattern Recognition Letters 26: 527- 532. 

32. Visani M, Garcia C,JolionJM (2005) Normalized radial basis Itinetion networks 
and bilinear discriminant analysis for face recognition; 15—16 Sept. 342—347. 



PLOS ONE I www.plosone.org 



9 



June 2014 I Volume 9 | Issue 6 | e100097 



Analytic Bilinear Discrimination of EEG Signals 



33. Li J, Janardan R, Li Q_ (2004) Two-dimensional linear discriminant analysis. 
Advances in Neural Information Processing Systems 17: 1569—1576. 

34. Miwakciehi ¥, Martinez-Montes E, Valdes-Sosa PA, Nishiyama N, Mizuhara H, 
et al. (2004) Decomposing EEG data into space-time-frequency components 
using parallel factor analysis. Neurolmage 22: 1035—1045. 

35. Li J, Zhang L, Tao D, Sun H, Zhao Q_ (2009) A prior neurophysiologic 
knowledge free tensor-based scheme for single trial EEG classification. Neural 
Systems and Rehabilitation Engineering, IEEE Transaelions on 17: 107 115. 

36. Xu P, Yang P, Lei X, Yao D (201 1) An Enhanced Probabilistic EDA for Multi- 
Class Brain Computer Interface. PLoS ONE 6: el4634. 

37. Zhang R, Xu P, Guo L, Zhang Y, Li P, et al. (2013) Z-Scorc Linear 
Discriminant Analysis for EEG Based Brain-Computer Interfaces. PLoS ONE 8: 
e74433. 



38. Duda RO, Hart PL, Stork DG (2012) Pattern classificadon: Wiley-intersciencc. 

39. McCleeryJP, Surtees AD, Graham KA, Richards JE, Apperly lA (2011) The 
neural and cognitive time course of theory of mind. The Journal of Neuroscience 
31: 12849-12854. 

40. PoHchJ, MargEila C {1997} P300 and probability: comparison of oddball and 
single-stimulus paradigms. IntemationEil journal of psychophysiology: official 
journal of the International Organization of Psychophysiology 25: 169. 

41. Falzon O, Camilleri KP, Muscat J (2012) The analytic common spatial patterns 
method for EEG-based BCI data. Journal of Neural Engineering 9: 045009. 

42. Mertcns R, PolichJ (1997) P300 from a single-stimulus paradigm: passive versus 
active tasks and stimulus modality. Electroencephalography and CHnical 
Neurophysiology/Evoked Potentials Section 104: 488-^97. 



PLOS ONE I www.plosone.org 



10 



June 2014 I Volume 9 | Issue 6 | e100097 



